Concepts and techniques in data mining and multidisciplinary
applications. Topics include data cleaning and transformation;
classification and predictive modeling; clustering; association
analysis; performance analysis and scalability; data mining in
advanced database systems, including text, audio, and images; and emerging themes and future
challenges. Students will gain hands-on experience and learn how to
implement and apply various data mining algorithms.
Section 001: Tuesday/Thursday 1:30-2:45pm
Exploratory Hall L003
Section 002: Tuesday/Thursday 3:00-4:15m
Horizon Hall 2016
Dr. Jessica Lin
Email: jessica [AT] gmu [DOT] edu
Office Hours: Tuesday/Thursday 11am-12pm
Madhukar Vongala
Roshan Dhakal
There will be 4 competition-style programming assignments in Python. Competition winners will get 1% extra credit added to the final grade. You are allowed 3 days of grace period past the deadline, with 10% penalty each day. You will receive 0 credit if the homework is not submitted by then. Note that internet trouble is not a valid excuse for subbmitting late. Therefore, you should plan to submit a few hours early to avoid last minute technical difficulties.
There will be quizzes throughout the semester covering lectures and readings, and one final exam. The purpose of the quizzes is to help you stay on track of the lecture materials, so they are typically short and easier compared to the final exam. The final exam is comprehensive. All exams are closed-book, and they must be taken at the scheduled time, unless prior arrangement has been made with the instructor. Missed exams cannot be made up. The lowest quiz grade will be dropped.
You will be able to earn class participation credit through
in-class activities.
Required: Introduction
to Data Mining by Pang-Ning Tan, Michael Steinbach, and Vipin
Kumar (click on the link for the companion website)
The GMU Honor Code is in effect at all times. In addition, the CS Department has further honor code policies regarding programming projects, which are detailed here. Some examples can be found here. Any deviation from the GMU or the CS department Honor Code is considered an Honor Code violation. All assignments for this class are individual unless otherwise specified. ChatGPT or other Generative-AI models may NOT be used in this course as an assistant in the assignments.
If you have a documented learning disability or other condition which may affect academic performance, make sure this documentation is on file with the Office of Disability Services and then discuss with the professor about accommodations.